As discussed in my blog post about electoral shocks and COVID-19, polling and economic data served as proxies for any impact of COVID-19 on the 2020 election in my model. Given the importance of economic fundamentals in determining election winners, it is not unreasonable to think that the pandemic’s toll on the economy must have hurt his re-election bid. Considering all that we have learned over the past few months, it seems that Trump’s strong (pre-pandemic) economic record and his incumbent advantage could have reasonably carried him to another 4 years in the White House. In an alternate universe without COVID-19, would Donald Trump have won the 2020 election? In other words, did Trump lose because of COVID-19, or would Biden still have defeated the incumbent president in a COVID-free world?
Given his fairly large popular vote loss in 2016, a Trump victory in the popular vote would have been unlikely even without the pandemic, but an Electoral College victory does not seem out of reach. Several paths existed for a Trump electoral victory even with the pandemic, but none of them panned out for him on Election Night. Donald Trump failed to secure Arizona, Georgia, and Wisconsin in 2020, despite having won all three of them in 2016. These are just a few of several states that were hit relatively hard by the pandemic and flipped from red to blue in 2020.
The stark policy implications of a Trump versus Biden presidency for the next four years warrants exploration of this narrative. Of course, I cannot run a randomized experiment with treatment and control groups to determine the causal impact of COVID-19 on Trump’s vote share. Ideally, we would compare Donald Trump’s vote share in areas hit by COVID-19 to areas untouched by the pandemic. However, the widespread nature of the pandemic makes natural experiments nearly impossible in this scenario. Because we do not have access to the ideal experimental data, this analysis regresses Trump’s 2020 vote share as a function of COVID cases or deaths and 2016 vote share to examine the association between Donald Trump’s vote share and COVID-19.
The 1918 Spanish influenza pandemic provides the only historical event of comparable circumstances and magnitude to the COVID-19 pandemic. Previous research on the 1918 midterms and 1920 general election suggests that the pandemic had a negligible electoral impact.1 However, the national dialogue surrounding the pandemic looks quite different now than it did a century ago. Relative to the magnitude of the pandemic, the Spanish flu received little public attention, which contrasts greatly with how COVID-19 has dominated nearly every facet of life in 2020. So, while we cannot automatically extend the conclusions from the 1918 pandemic to COVID-19, we can apply a similar methodology to take a preliminary look at COVID’s electoral impact.
In Democracy For Realists, Achen and Bartels examined whether the states and cities hit hardest by the pandemic responded differently at the polls.2 While they focused on gubernatorial races during the 1918 midterms, I plan on applying the underlying structure of their regressions to the 2020 data. Similar to Achen and Bartels, I plan on running a simple regression that maps Donald Trump’s 2020 vote share as a function of his 2016 vote share and COVID cases or deaths in the respective geography.3
This regression draws a mixture of COVID, population, and voting data from several sources:
Achen and Bartels centered their analysis around excess flu deaths and vote share in the previous election. Following their lead, my initial regression mapped Trump’s 2020 state-level vote share from his 2016 vote share and total COVID deaths up to Election Day as a percentage of that state’s population. The p-value for the COVID deaths coefficient was quite large; the same regression with cases instead of deaths yielded a slightly more significant p-value, but still not enough to declare statistical significance. Both of these coefficients were positive, but the large p-values make it difficult to draw any conclusions from the regression. Perhaps the pandemic was unlikely to sway voters in heavily partisan states but was more of a deciding factor in battleground states?
Sure enough, the state-level regressions yielded more significant coefficients for the COVID terms when focusing solely on battleground states. The most significant coefficient was when looking at COVID cases in battleground states, which had a p-value of \(0.086\). If we want to interpret this coefficient with a significance level of \(\alpha = 0.10\), we can say that we have sufficient evidence that a 1% increase in a battleground state’s case count as a percentage of the population is associated with an approximate increase of \(0.57\)% of Trump’s 2020 two-party vote share within that state.
| All States and COVID Deaths | All States and COVID Cases | Battleground States and COVID Deaths | Battleground States and COVID Cases | |||||
|---|---|---|---|---|---|---|---|---|
| Predictors | Estimates | p | Estimates | p | Estimates | p | Estimates | p |
| (Intercept) | 0.98 | 0.497 | 1.36 | 0.282 | 2.04 | 0.647 | 5.62 | 0.206 |
| Trump’s 2016 Two-Party Vote Share | 0.95 | <0.001 | 0.93 | <0.001 | 0.90 | <0.001 | 0.83 | <0.001 |
| Deaths as Percent of Population | 3.36 | 0.644 | 26.03 | 0.141 | ||||
| Cases as Percent of Population | 0.28 | 0.276 | 0.57 | 0.086 | ||||
| Observations | 50 | 50 | 16 | 16 | ||||
| R2 / R2 adjusted | 0.972 / 0.971 | 0.973 / 0.972 | 0.905 / 0.890 | 0.911 / 0.897 | ||||
The significance of that term depends on how you select your \(\alpha\) value. Even with a p-value of \(\alpha = 0.05\) that deems the results insignificant, the regression indicates the possibility of a positive association between COVID cases and Trump’s 2020 vote share in battleground states when controlling for Trump’s 2016 vote share.
Next, I wanted to extend the analysis one step further and run the same regressions with county-level COVID metrics and vote shares. Again, all of the coefficients indicated a positive relationship between Donald Trump’s 2020 vote share and an increase in COVID cases or deaths as a percentage of the county’s population. This time, all of the slope coefficients yielded significant p-values at an \(\alpha = 0.001\) significance level, confirming the positive association: ADD NOTE ABOUT INCREASING SAMPLE SIZE AND DECREASING P-VALUE
| All Counties and COVID Deaths | All Counties and COVID Cases | Counties in Battleground States and COVID Deaths | Counties in Battleground States and COVID Cases | |||||
|---|---|---|---|---|---|---|---|---|
| Predictors | Estimates | p | Estimates | p | Estimates | p | Estimates | p |
| (Intercept) | -1.40 | <0.001 | -1.19 | <0.001 | 0.73 | 0.057 | 1.18 | 0.004 |
| Trump’s 2016 Two-Party Vote Share | 1.00 | <0.001 | 1.00 | <0.001 | 0.97 | <0.001 | 0.97 | <0.001 |
| Deaths as Percent of Population | 11.22 | <0.001 | 14.94 | <0.001 | ||||
| Cases as Percent of Population | 0.30 | <0.001 | 0.28 | <0.001 | ||||
| Observations | 2978 | 2978 | 1252 | 1252 | ||||
| R2 / R2 adjusted | 0.975 / 0.974 | 0.974 / 0.974 | 0.964 / 0.964 | 0.961 / 0.961 | ||||
The below plots provide a clearer visualization of this positive association between COVID-19 and Trump’s 2020 vote share within battleground states. However, it is important to note that these plots do not control for lagged vote share like the regression does. While I attempted to illustrate the multivariate relationship between the variables by changing the size of the points on the state plot and faceting the county plots, the plots still fail to show the complete picture of the regression since the 2016 vote share is a continuous variable. Controlling for 2016 vote share in the regression yields an even stronger relationship between Trump’s 2020 vote share and COVID counts than displayed by the below graphs, but at least they provide a rough visualization of the relationship:
Taking an interest in the results of the regression, I decided to take a more nuanced look at the implications of these findings. The regressions take very crude measures of COVID numbers and previous vote share, without considering possible confounding variables. My previous election model used a mixture of demographic variables, economic metrics, incumbency status, and polling numbers to produce a probabilistic forecast for the 2020 election. While the forecast did not match the election results exactly, it did match the outcomes fairly closely, so it would not hurt to examine what happens without any measured impact of COVID-19.
COVID-19 bled into the polling and economic data used for the predictions, so I took steps to try to erase or minimize any impact of COVID-19 on these metrics:
I used a very similar5 model equation to that from my final forecast. In this hypothetical, pandemic-free world, Trump lost both the Electoral College and the national two-party popular vote by an even larger margin than what panned out on the actual election day:
| Candidate | Electoral Votes | Two-Party Popular Vote |
|---|---|---|
| Biden | 349 | 53.265 % |
| Trump | 186 | 46.735 % |
The pervasive nature of COVID-19 eliminates the possibility of a natural experiment, which makes a clear test of the impact of COVID-19 on the presidential election nearly impossible. The above regressions and hypothetical forecast do, however, indicate that COVID likely did not hurt–and quite possibly could have helped–Donald Trump in his 2020 election bid. The regression does not control for demographic and socioeconomic changes in the past four years, but Trump’s 2016 vote share does control for the state or county’s position in the prior election. While the above tests do not allow us to conclude that COVID-19 caused Trump to perform better in the 2020 election, these preliminary measures do support the likelihood of a positive association between COVID numbers and Donald Trump’s vote share.
While counterintuitive and the exact opposite of my initial hypothesis, this finding is not the first suggestion that COVID may have boosted Trump’s support in the 2020 election. Perhaps lockdowns following local outbreaks angered voters who support the idea of small government? This hypothesis does not seem out of reach given the protests and kidnapping plots that emerged in response to Michigan’s crackdown on COVID cases earlier this year.
The possibility of a positive association between Trump’s vote share and the impact of COVID-19 warrants further exploration. While I certainly hope that any future research on the topic would not need to be applied to any global pandemics in the near future, any lessons from this election will help to enhance our general understanding of how voters respond to large-scale crises.
[Achen and Bartels, 2017] Achen, C. H. and Bartels, L. M. (2017). Democracy for realists: Why elections do not produce responsive government↩︎
[Achen and Bartels, 2017] Achen, C. H. and Bartels, L. M. (2017). Democracy for realists: Why elections do not produce responsive government↩︎
Achen and Bartels also included a dummy variable that indicated whether the specified gubernatorial candidate was a Democratic incumbent. Since Donald Trump ran in both 2016 and 2020, I did not include an indicator for incumbency and instead just focused on his vote share for both races.↩︎
While this includes some data from after COVID-19 came to the United States, I had to expand the window of time to get a large enough sample size for the model to run for each state.↩︎
I added an interaction term to the model from my original forecast for this iteration. In retrospect, it did not make sense to not include it in the first place since the state of the economy likely has opposite effects for incumbent and non-incumbent candidates. The final equation used to predict \(\hat{y}\), the probability of voting for the Democratic or Republican candidate within each state, in this updated model is \(\hat{y} = \text{incumbent} * \text{gdp_growth_qt} + \text{avg_state_poll} + \text{prev_dem_margin} + \text{black_change} + \text{age20_change} + \text{age65_change}\)↩︎